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^5 Abstract 

^^ Is the large influence that mutual funds assert on the U.S. financial 

,_i system spread across many funds, or is it is concentrated in only a few? 

f-H We argue that the dominant economic factor that determines this is 

Xy^ market efflciency, which dictates that fund performance is size inde- 

pendent and fund growth is essentially random. The random process 
is characterized by entry, exit and growth. We present a new timc- 



I dependent solution for the standard equations used in the industrial 



organization literature and show that relaxation to the steady-state 
solution is extremely slow. Thus, even if these processes were station- 
ed ary (which they are not), the steady-state solution, which is a very 
^ heavy-tailed power law, is not relevant. The distribution is instead 

well-approximated by a less heavy-tailed log-normal. We perform an 
Jq empirical analysis of the growth of mutual funds, propose a new, more 

fT^ accurate size-dependent model, and show that it makes a good pre- 

, I diction of the empirically observed size distribution. While mutual 

f~^ funds arc in many respects like other firms, market efficiency intro- 

00 duces effects that make their growth process distinctly different. Our 

^D work shows that a simple model based on market efficiency provides a 

j>. good explanation of the concentration of assets, suggesting that other 

: '"j effects, such as transaction costs or the behavioral aspects of investor 

rN choice, play a smaller role. 
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1 Introduction 



In the past decade the mutual fund industry has grown rapidly, moving 
from 3% of taxable household financial assets in 1980, to 8% in 1990, to 
23% in 2007q In absolute terms, in 2007 this corresponded to 4.4 trillion 
USD and 24% of U.S. corporate equity holdings. Mutual funds account 
for a significant fraction of trading volume in financial markets and have 
a substantial influence on prices. This raises the question of who has this 
influence: Are mutual fund investments concentrated in a few dominant 
large funds, or spread across many funds of similar size? Do we need to 
worry that a few funds might become so large that they are "too big to 
fail" ? What are the economic mechanisms that determine the concentration 
of investment capital in mutual funds? 

Large institutional investors are known to play an important role in the 
market 



Corsetti et al. , 2001 



Gabaix et al. recently hypothesized that the 
fund size distribution plays a central role in explaining the heavy tails in 
the distribution of both trading volume and price return^ If their theory 
is true this would imply that the heavy tails in the distribution of mutual 
fund size play an important role in determining market risk. 

While it is standard in economics to describe distributional inequalities 
in terms of statistics such as the Gini or Herflndahl indices, as we show in 
Appendix A, this approach is inadequate to describe the concentration in 
the tail. Instead, the best way to describe the concentration of assets is 
in terms of the functional form of the tail. As is well-known in extreme 
value theory [Embrechts et ah 1997| , the key distinction is whether all the 
moments of the distribution are flnite. If the tail is truly concentrated, the 
tail is a power law, and all the moments above a given threshold, called the 
tail exponent, are inflnite. So, for example, if the tail of the mutual fund 
size distribution follows Zipf's law as hypothesized by Gabaix et al., i.e. if 
it were a power law with tail exponent one, this would imply nonexistence 
of the mean. In this case the sample estimator fails to converge because 
the tails are so heavy that with significant probability a single fund can be 
larger than the rest of the sample combined. This is true even in the limit 
as the sample size goes to infinity. Thus power law tails imply a very high 



^ Data is taken from the Investment Company Institute's 2007 fact book available at 
www.ici.org. 

^ The equity fund size distribution was argued to be responsible for the observed 

and 



distribution of trading volume Levy et al 
Gabaix et al 
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have argued that it is important for explaining the distribution of price 



returns Gabaix et al. 



2003 Gabaix et al. 



2006 



degree of concentration. 

Instead, empirical analysis shows that the tail of the mutual fund size 
distribution is not a power law, and is well-approximated by a lognormal 
[Schwarzkopf and Farmer 2010 . Thus, while the distribution is heavy tailed. 



it is not as heavy tailed as it would be if the distribution were a power law. 
The key difference is that for a log- normal all of the moments exist. 

This naturally leads to the question of what economic factors determine 
the tail properties of the mutual fund distribution. There are two basic types 
of explanation. One type of explanation is based on a detailed description 
of investor choice, and another is based on efficient markets, which predicts 
that growth should be random, and that the causes can be understood in 
terms of a simple random process description of entry, exit and growth. Of 
course market efficiency depends on investor choice, but the key distinction 
is that the random process approach does not depend on any of the details, 
but rather only requires that no one can make superior investments based 
on simple criteria, such as size. 

Explanations based on investor choice can in turn be divided into two 



types: Rational and behavioral. For example. Berk and Green 2004 have 
proposed that investors are rational, making investments based on past per- 
formance. Their theory implies that the distribution of fund size is deter- 
mined by the skill of mutual fund managers and the dependence of trans- 
action costs on size. If we assume, for example, that the transaction cost is 
a power law (which includes linearity) if the distribution of fund size is log- 
normal, then it is possible to show that the distribution of mutual fund skill 
must also be log-normal. Unfortunately, without a method of measuring 
skill this is difficult to test. 

Another type of explanation is behavioral, i.e. that investors are strongly 
influenced by factors such as advertising, fees, and investment fadq^ We 
strongly suspect that this is true, and that they play an important role in 
determining the size of individual funds. The question we investigate here 
is not whether such effects exist, but whether they are essential to explain 
the form of the distribution. 

The alternative is that the details of investor choice don't matter, and 
that the distribution of fund size is driven by market efficiency, which dic- 
tates an approach based on the random process of entry, exit and growth. 
The random process approach was originally pioneered as an explanation 
for firm size by Gibrat, Simon and Mandelbrot, and is popular in the indus- 



Barber et al. 



2005 



have found that investors flows are correlated to marketing and 



advertising while they are not correlated to the expense ratio. 



trial organization literaturq^ The basic idea is that while details of investor 
choice are surely important in determining the size of individual funds, the 
details may average out or be treatable as noise, so that in aggregate they 
do not matter in shaping the overall size distribution. 

On the face of it, however, there seems to be a serious problem with 
this approach. Under simple assumptions about the entry, exit and growth 
of fund size, Gabaix et al. [2003 showed that the steady state solution is 
a power law; a similar argument is described in Montroll and Shlesinger 
1982 and Reed 2001p As already mentioned, however, the upper tail 
of the empirical distribution is a log-normal, not a power law. Thus there 
would seem to be a contradiction. Apparently either the correct random 
process is more complicated, or this whole line of attack fails. 

We show here that the central problem comes from considering only 
the steady state (i.e. infinite time) solution. We study the same equations 
considered by Gabaix et al. and Reed, but we find a more general time- 
dependent solution, and show that the time required to reach steady state 
is very long. The mutual fund industry is rapidly growing and, even if the 
growth process had been stationary over the last few decades, not enough 
time has elapsed to reach the stationary solution for the fund size distribu- 
tion. In the meantime the solution is well approximated by a log-normal. 
This qualitative conclusion is very robust under variations of the assump- 
tions. In contrast to the hypothesis of Berk and Green, it does not depend 
on details such as the distribution of investor skill - the log-normal property 
emerges automatically from market efficiency and the random multiplicative 
nature of fund growth. 

To test our conjectures more quantitatively we study the empirical prop- 
erties of entry, exit and growth of mutual funds, propose a more accurate 
model than those previously studied, and show it makes a good prediction 
of the empirically observed fund size distribution. The model differs from 
previous models in that it incorporates the fact that the relative growth 
rate of funds slows down as they get biggeiQ This makes the time needed 
to approach the steady state solution even longer: Whereas the relaxation 
time for the size- independent diffusion model is several decades, for the more 
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accurate size-dependent model it is more than a century. 

Market efficiency is the key economic principle that makes the random 
process model work, and dictates many of its properties. It enters the story 
in several ways. (1) The fact that stock market returns are essentially ran- 
dom implies that growth fluctuations are random, for two reasons: (a) With- 
out inflows and outflows, under the principle that past returns are not in- 
dicative of future returns, fund growth is random, (b) Although investors 
chase past returns, since what they are chasing is random, fund growth due 
to inflow and outflow is random on sufficiently long time scales. (2) Effi- 
ciency dictates that mutual fund performance must be independent of size. 
Thus as mutual funds randomly diffuse through the size space, there is no 
pressure pushing them toward a particular size. (3) Efficiency, together with 
the empirical fact that the relative importance of fund inflows and outflows 
diminishes as funds get bigger, implies that the mean growth rate and the 
growth diffusion approach a constant in the large size limit. As we show, 
this shapes the long-term properties of the size distribution. All of these 



points are explained in more detail in Section 4.3 

Market efficiency makes mutual funds unusual relative to most other 
types of firms. For most firms, in the large size limit the mean and standard 
deviation of the growth rate are empirically observed to decay to zero. For 
mutual funds, in contrast, due to market efficiency they both approach a 
positive limit. This potentially affects the long-term behavior: Most firms 
approach a solution that is thinner than a log-normal, i.e under stationary 
growth conditions their tails are getting thinner with time, whereas mutual 
funds approach a power law, so their tails are getting fatter with time. 
Nonetheless, as we have already mentioned, even under stationary growth 
conditions the approach to steady-state takes so long that this is a moot 
point. 

At a broader level our work here shows how the non-stationarity of 
market conditions can prevent convergence to an "equilibrium" solution. 
Nonetheless, even under stationary conditions the random process model 
usefully describes the time-dependent relationships between entry, exit and 
growth phenomena on one hand and size on the other hand. While we can- 
not show that the random process model is the only possible explanation, 
we do show that it provides a good explanatiorQ The conditions for this 
are robust, depending only on market efficiency, without the stronger re- 



"^ While variations in the assumptions about the random process preserve certain quali- 
tative conclusions, such as the log-normal character of the upper tail, we found that getting 
a good fit to the data requires a reasonable degree of fidelity in the modeling process. The 
size-dependent nature of the diffusion process, for example, is quite important. 



quirements of perfect rationality, or the complications of mapping out the 
idiosyncrasies of human behavior. 

The paper is organized as follows. In Section [2] we develop the standard 



exit and entry model. Section 2.1 presents the time-dependent solution for 
the number of funds, Section 2.2 presents the time-dependent solution for 
the size distribution, and Section 2.3 introduces a size-dependent model. 
Section |3] describes our data. In Section [4] we perform an empirical analysis 
to justify our assumptions and to calibrate the model. In Section [5] we 
present simulation results of the proposed model and compare them to the 
empirical data. Finally Section [6] presents our conclusions. 

2 Model 

Our central thesis in this paper is that due to market efficiency the mutual 
fund size distribution can be explained by a stochastic process governed by 
three key underlying processes: the size change of existing mutual funds, the 
entry of new funds and the exit of existing funds. In this section we introduce 
the standard diffusion model and derive a time-dependent solution for the 
special case when the diffusion process has constant mean and variance. We 
then make a proposal for how to model the more general case where the 
mean and variance depend on size. 

The aim of the model we develop here is to describe the time evolution 
of the size distribution, that is, to solve for the probability density function 
p(uj,t) of funds with size s at time t, where u = logs. The size distribution 

can be written as 

n(uj, t) , , 

where n(cj,t) is the number of funds at time with logarithmic size uj and 
N{t) = / n(a;, €)Aijj is the total number of funds at time t. To simplify the 
analysis we solve separately for the total number of funds A^(t) and for the 
number density n(w,t). 

2.1 Dynamics of the total number of funds 

As we will argue in Section |4j the total number of funds as a function of 
time can be modeled as 

= u-XN 2) 

dt ^ ' 



where v is the rate of creating new funds and A is the exit rate of existing 
funds. Under the assumption that v and A are constants this has the solution 

iV(t) = ^(l-e-^*)0(t), (3) 

where 6{t) is a unit step function at t = 0, the year in which the first funds 
enter. This solution has the surprising property that the dynamics only 
depend on the fund exit rate A, with a characteristic timescale 1/A. For 
example, for A ~ 0.09, as estimated in Section El the timescale for N{t) to 
reach its steady state is only roughly a decade. An examination of Table |T] 
makes it clear, however, that v = constant is not a very good approximation. 
Nonetheless, if we crudely use the mean creation rate v ~ 900 from Table |T] 
and the fund exit rate A k, 0.09 estimated in Section |4j the steady state 
number of funds should be about N ^ 10, 000, compared to the 8, 845 funds 
that actually existed in 2005. Thus this gives an estimate with the right 
order of magnitude. 

The important point to stress is that the dynamics for N{t) operate on a 
different timescale than that of n{uj,t). As we will show in the next section 
the characteristic timescale for n{uj,t) is much longer than that for N{t). 

2.2 Solution for the number density n{u,t) 

We define and solve the time evolution equation for the number density 
n{u},t). The empirical justification for the hypotheses of the model will be 
given in Section [4j The hypotheses are: 

• The entry process is a Poisson process with rate i', such that at time 
t a new fund enters the industry with a probability udt and (log) size 
uj drawn from a distribution f(uj,t). We approximate the entry size 
distribution as a log- normal distribution in the fund size s, that is a 
normal distribution in lo given by 

f{u:,t) = ^exp (_t^^\ 9{t-to), (4) 

where loq is the mean log size of new funds and o"^ is its variance. 
G{t — to) is a unit step function ensuring no funds funds enter the 
industry before the initial time to- 

• The exit process is a Poisson process such that at any time time t a 
fund exits the industry with a size independent probability Adt. 



• The size change is approximated as a (log) Brownian motion with a 
size dependent drift and diffusion term 

dw = /x(a;)dt + a{uj)dW, (5) 

where dW is an i.i.d random variable drawn from a zero mean and 
unit variance normal distribution. 

Under these assumptions the forward Kolmogorov equation (also known 
as the Fokker-Plank equation) defining the time evolution of the number 
density [Gardiner 2004 is given by 



d d d'^ 

Q^ri{uj, t) = uf{uj, t) - Xn{uj, t) - q- [K^)n{uj, t)] + —^ [D{uj)n{uj, t)] , (6) 

where D{uj) = a{io)'^/2 is the size diffusion coefficient. The first term on the 
right describes the entry process, the second describes the fund exit process 
and the third and fourth terms describe the change in size of a existing 
funds. 

2.2.1 Approximate solution for large funds 

To finish the model it is necessary to specify the functions /^(w) and D{oj). 
It is convenient to define the relative change in a fund's size As(t) as 

such that drift and diffusion parameters in our model are given by 

/x(lj) = E[log(l + A,)] D{uj) = ^Var[log(l + A,)]. 

The relative change can be decomposed into two parts: the return A^ and 
the fractional investor money flux Aj(t), which are simply related as 

A,(t) = Aj(i) + A,(f). (8) 

The return A^ represents the return of the fund to its investors, defined as 

A it) - ^^-4y(t + 1) - NAVjt) 



where NAV{t) is the Net Asset Value at time t. The fractional money flux 
Aj(i) is the change in the fund size by investor deposits or withdrawals, 
defined as 

A/(t) = j^^ . (10) 

In Section |4] we will demonstrate empirically that the returns A^ are 
independent of size, as they must be for market efficiency. In contrast the 
money flux Af decreases monotonically with size. In the large size limit the 
returns A^ dominate, and thus it is reasonable to treat fi{s) as a constant, 
A* = A*oo- Market efficiency also implies that in the large size limit the 
standard deviation a{s) is a constant, i.e. a = cToo- Otherwise investors 
would be able to improve their risk adjusted returns by simply investing in 
larger funds. 

With these approximations the evolution equation becomes 

d d d'^ 

—n{uj,t) = uf{uj,t) - \n{uj,t) - fi—n{u;,t) + D—^n{uj,t), (11) 

In this and subsequent equations, to keep things simple we use the notation 
^ = ^^/2 and fi = fioo- 

The exit process is particularly important, since it is responsible for 
thickening the upper tail of the distribution. The intuition is as follows: 
Since each fund exits the industry with the same probability, and since there 
are more small funds than large funds, more small funds exit the industry. 
This results in relatively more large funds, making the distribution heavy- 
tailed. As we will now show this results in the distribution evolving from 
a log- normal upper tail to a power law upper tail. In contrast, the entry 
process is not important for determining the shape of the distribution, and 
influences only the total number of funds A^. This is true as long as the 
entry size distribution f{u},t) is not heavier-tailed than a lognormal, which 
is supported by the empirical data. 

In the large size limit the solution for an arbitrary entry size distribution 
/ is given by 



r ^ I /■* -\t' 1 

niuAj = u / / exp exp 



{oj-uj'-fj,t'f 
4Dt' 



f{u)',t—t') dt'duj' . 



(12) 
Stated in words, a fund of size uj' enters at time t — t with probability 
f{uj',t — r). The fund will survive to time t with a probability exp(— Ar) 



and will have a size oo at time t with a probability according to (23) 



If funds enter the industry with a constant rate v beginning at t = 0, 
with a log-normal entry size distribution f(uj,t) centered around loq with 
width a^j as given by Q, the size density can be shown to be 
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(15) 



(16) 



where Erf is the error function, i.e. the integral of the normal distribution. 
Approximating the distribution of entering funds as having zero width 
simplifies the solution. Let us define a large fund as one with a; ^ wo, where 
loq is the logarithm of the typical entry size of one million USD. For large 
funds we can approximate the lognormal distribution as having zero width, 
i.e. all new funds have the same size ujq. The number density is then given 

by 
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Since 7 > 1/4 (14), the density vanishes for both a; — t- 00 and uj — t- —00 
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2.2.2 Steady state solution for large funds 

The steady state solution for large times is achieved by taking the t 



limit of (17), which gives 
n{uj) 



V l^ ( ^ 



Uq 



7|a; -Wo 



CX) 



(18) 



Since the log size density (jTSJ) has an exponential upper tail p{u)) ~ exp(— (^s'^) 
and s = exp(a;) the CDF for s has a power law tail with an exponenlrl ^^ , 
i.e. 

P{s>X)^X-^'. (19) 



Substituting for the parameter 7 using Eq. ( 14 ) for the upper tail exponent 
yields 



6 



2D 



(20) 



Note that this does not depend on the creation rate v. Using the average 



parameter values in Table III the asymptotic exponent has the value 

Cs = 1.2±0.6. (21) 

This suggests that if the distribution reaches steady state it will follow Zipf 's 
law, which is just the statement that it will be a power law with C,s ~ ^■ 
As discussed in the introduction, this creates a puzzle, as the empirical 



distribution is clearly log-normal [Schwarzkopf and Farmer, 2010 



2.2.3 Timescale to reach steady state 

Since we have a time dependent solution we can easily estimate of the 



timescale to reach steady state. The time dependence in Eq. 17 



IS con- 



tained in the arguments of the error function terms on the right. When 
these arguments become large, say larger than 3, the solution is roughly 
time independent, and can be written as 



t > 



9D 

47^2 



1 + 
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(22) 



To calculate the tail exponent of the density correctly one must change variables 



through p(s) = p{u;) 



ds 



^ . This results in a CDF with a tail exponent of ^s 
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Using the average values in Table III in units of months /x = //qo ~ 0.005, 
D = cr^/2 and Uoo ~ 0.05. This gives 



t > 180 l + Jl + 0.7\u;-ujo\ 



where the time is in months. Plugging in some numbers from Table III 
makes it clear that the time scale to reach steady state is very long. For 
instance, for funds of a billion dollars it will take about 170 years for their 
distribution to come within 1 percent of its steady state. This agrees with 
the empirical observation that there seems to be no significant fattening of 
the tail in the fifteen years from 1991 - 2005. Note that the time required for 
the distribution n(a;, t) to reach steady state for large values of u is much 
greater than that for the total number of funds N{t) to become constant. 

During the transient phase the solution remains approximately log- normal 
for a long time. If funds only change in size and no funds enter or exit, then 
the resulting distribution is normal 



h(w, t) = , exp 



{uj - fj.tf 
ADt 



(23) 



which corresponds to a size distribution p(s) with a lognormal upper tail. 
While the exit process acts quickly in changing the total number of funds, it 
acts slowly in changing the shape. This is the key reason why the distribution 
remains approximately log-normal for so long. 

2.3 A better model of size dependence 

The mean rate of growth and diffusion are in general size dependent. We 
hypothesize that the mean growth rate fj,{s) and the standard deviation a{s) 
are the sum of a power law and a constant, of the form 

as{s) = aos-^ + cToc (24) 

The constant terms come from mutual fund returns (neglecting inflow or 
outflow of funds), and must be constant due to market efficiency, as ex- 



plained in more detail in Section 4.3, The power law terms, in contrast. 



are due to the fiow of funds in and out of the market. There is a sub- 
stantial literature of proposed theories for this, including ourq^ We present 

^ There has been a significant body of work attempting to explain the heavy tails in 
the growth rate of firms and the associated size dependence in the diffusion rate. See 
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the empirical evidence for the power law hypothesis and explain the role of 



efficiency in more detail in Section 4.3.2 



The functional form given above for the size dependence can be used to 
make a more accurate diffusion model. The non vanishing drift /Xoo > and 
diffusion terms o"oo > are essential for the distribution to evolve towards a 
power law. As already mentioned, due to market efficiency E[Ar{s)] must 
be independent of s, and since E[Af(s)] is a decreasing function of s, for 
large s /i(s) = S[Ar(s)] + E[Af{s)] = fioc > 0. This distinguishes mutual 
funds from other types of firms, which are typically observed empirically to 



have idoo = o-Qo = [Stanley et al. 1996 Matia et al. , 2004] . Assuming 



that other types of ffims obey similar diffusion equations to those used here, 
it can be shown that the resulting distribution has a stretched exponential 
upper tail, which is much thinner than a power la^\ 
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3 Data Set 

We test our model against the CRSP Survivor-Bias-Free US Mutual Fund 
Database. Because we have daily data for each mutual fund, this database 
enables us to investigate the mechanism of fund entry, exit and growth to 
calibrate and test our modej^ We study the data from 1991 to 200q^ We 
define an equity fund as one whose portfolio consists of at least 80% stocks. 
The results are not qualitatively sensitive to this, e.g. we get essentially the 
same results even if we use all funds. The data set has monthly values for 
the Total Assets Managed (TASM) by the fund and the Net Asset Value 



Amaral et al.[ [19971 [Buldy rev ^^ al.|[T997||Amaral et al.| [19981 [PeFabritiis ^t aH[2003 



vlatia et al.[ |2004| |Bottazzi[ |2001[ |Sutton| |2001| |Wyart and i3ouchaud[ |2003| [Bottazzil 



and Secchi||2003bl |2005||Fu et al"! |2005||Riccaboni et aT-l |2008| [Podobnik et al.| |2008 
Our theory argues for an additive replication model, and produces predictions that fit 
the data extremely well for a diverse set of different phenomena, including mutual funds 
[Schwarzkopf et al.j |2010| . We argue that the fundamental reason for the power tails is 
the influence network of investors. 

^° A stretched exponential is of the form p{x) ~ exp(aa;~''), where a and b are positive 
constants. There is some evidence in the empirical data that the death rate A also decays 
with size. However, in our simulations we found that this makes very little difference for 
the size distribution as long as it decays slower than the distribution of entering funds, 
and so in the interest of keeping the model parsimonious we have not included this effect. 

^^ Note that we treat mergers as the dissolution of both original firms followed by the 
creation of a new (generally larger) firm. This increases the size of entering firms but does 
not make a significant difference in our conclusions. 

^^ There is data on mutual funds starting in 1961, but prior to 1991 there are very few 
entries. There is a sharp increase in 1991, suggesting incomplete data collection prior to 
1991. 
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(NAV). We define the size s of a fund to be tlie value of tlie TASM, measured 
in millions of US dollars and corrected for inflation relative to July 2007. 
Inflation adjustments are based on the Consumer Price Index, published by 
the BLS. In Table |l] we provide summary statistics of the data set and as 
seen there the total number of equity funds increases roughly linearly in 
time, and the number of funds in the upper tail Ntaii also increases. 

4 Empirical investigation of size dynamics 

In this section we empirically investigate the processes of entry, exit and 
growth, providing empirical justification and calibration of the model de- 
scribed in Section 2. 

4.1 Fund entry 

We begin by examining the entry of new funds. We investigate both the 
number A^eiiter(i) and size s of funds entering each year. We perform a 
linear regression of A^eiiter(i) against the number of existing funds N[t — 1), 
yielding slope a = 0.04 it 0.05 and intercept j3 = 750 ± 300. The slope is not 
statistically significant, justifying the approximation of entry as a Poisson 
process with a constant rate v, independent of N{t). 

The size of entering funds is more complicated. In Figure [l] we compare 
the distribution of the size of entering funds /(s) to that of all existing 
funds. The distribution is somewhat irregular, with peaks at round figures 
such as ten thousand, a hundred thousand, and a million dollars. The 
average size ^^ of entering funds is almost three orders of magnitude smaller 



than that of existing funds, making it clear that the typical surviving fund 
grows significantly after it enters. It is clear that the distribution of entering 
funds is not important in determining the upper tail^^ The value of the 
mean log size and its variance are calculated from the data for each period 
as summarized in Table Hill 



^^ When discussing the average size one must account for the difference between the 
average log size and the average size: Due to the heavy tails the difference is striking. The 
average entry log size E[tJc] ~ 0, corresponding to a fund of size one million, while if we 
average over the entry sizes E[sc] = E[e"''], we get an average entry size of approximately 
30 million. For comparison, both the average size and the average log size of existing 
funds are quot ed in Table [l| 



In Section 2.2 we showed that the entry process is not important as long as the tails of 
the entry distribution / are sufficiently thin. We compared the empirical / to a log-normal 
and found that the tails are substantially thinner. 
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Figure 1 : The probability density for the size s of entering funds in milhons 
of dollars (solid line) compared to that of all funds (dashed line) including 
all data for the years 1991 to 2005. The densities were estimated using a 
gaussian kernel smoothing technique. 



Thus the empirical data justifies the approximation of entry as a Poisson 
process in which an average of v funds enter per month, with the size of each 
fund drawn from a distribution f(uj,t). 

4.2 Fund exit 

Unlike entry, fund exit is of critical importance in determining the long-run 
properties of the fund size distribution. In Figure [2] we plot the number of 
exiting funds Nex±t{t) as a function of the total number of funds existing in 
the previous year, N(t — 1). There is a good fit to a line of slope A, which 
on an annual time scale is A = 0.092 it 0.030. This justifies our assumption 
that fund exit is a Poisson process with constant rate A. 



4.3 Fund growth 

We first test the i.i.d and normality assumptions of the diffusion growth 
model, and then test to demonstrate the size dependence of the growth 



process that we proposed in Section 2.3 We also discuss the diverse roles 
that efficiency plays in shaping the random process for firm growth in more 
detail. 
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Figure 2: The number of equity funds exiting the industry A^exit(*) in the 
year t as a function of the total number of funds existing in the previous 
year, N(t — 1). The plot is compared to a linear regression (full line). The 
error bars are calculated for each bin under a Poisson process assumption, 
and correspond to the square root of the average number of funds exiting 
the industry in that year. 



4.3.1 Justification for the diffusion model 

In the absence of entry or exit we have approximated the growth of existing 
funds as a multiplicative Gibrat-like proces^^ satisfying a random walk in 
the log size u. This implicitly assumes that A^ is an i.i.d normal random 
variable. 

The assumption of independence is justified by market efficiency, which 



requires that the returns A^ of a given fund should be random Bollen and 



Busse , 2005 ; Carhart , 1997 . Under the decomposition of the total growth 
as As = Aj. + Af, as demonstrated in the next sub-section, in the large 
size limit the returns A^, dominate, so under market efficiency the i.i.d. 
assumption is automatically valid. 

This is not so obvious for smaller size firms, where the money flux Af 
dominates the total growth A^. It is well known that investors chase past 



^^ A Gibrat-like process is a multiplicative process in which the size of the fund at any 
given time is given as a multiplicative factor times the size of the fund at a previous time. 
In Gibrat's law of proportionate effect [Gibrat[ |1931| the multiplicative term depends 
linearly on size while here we allow it to have any size dependence. 
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performance^ Even though the past performance they are chasing is ran- 
dom, if they track a sufficiently long history of past returns, this can induce 
correlations. This causes correlations in the money flux Aj, which in turn 
induces correlations in the total size change A^. 

To test whether such correlations are strong enough to cause problems 
with the random process hypothesis, we perform cross-sectional regressions 
of the form 

Af{t) = (3 + /3iA,(t - 1) + /32A,(t - 2) + . . . + fS^Arit - 6) + ^(t), (25) 

where ^(t) is a noise term. The results are extremely noisy; for example, 
when we perform separate regressions in five different periods, eight of the 
thirty possible coefficients Pi shown in Table [IT] are negative and only two of 
them are significant at the two standard deviation level. We also perform 
direct tests of the correlations in Aj and we find that they are small. This 
justifies our use of the i.i.d. hypothesis. 

The normality assumption is also not strictly true. Here we are saved 
by the fact that the money flux Aj is defined in terms of a logarithm, and 
while it has heavy tails, they are not sufficiently heavy to prevent it from 
converging to a normal. We have explicitly verified this by tracking a group 
of funds in a given size range over time and demonstrating that normality 
is reached within 5 months. Thus even though the normality assumption is 
not true on short timescales it rapidly becomes valid on longer timescales. 

4.3.2 Size dependence of the growth process 

We now test the model for the size dependence of the growth process pro- 



posed in Section 2.3, We also discuss the crucial role of the decomposition 



into returns and money flux in determining the size dependence. 



Figure [3] gives an overview of the size dependence for both the returns A,. 
and the money flux Aj . The two behave very differently. The returns A,, are 



essentially independent of size This is expected based on market efficiency. 



For empirical evidence that investors react to past performance see Remolona et al 



19971 jBussej [2001] [Chevalier, Judith and Ellison, Glenn[ [1997| |Sirri, Erik R. and Tufano,| 



Peter 



1998 



2002 



BoUen 2007 



Guercio, Diane Del and Tkac, Paula A. 
^he independence of the return Ar on size is veritied by performing a linear regression 
of fj,r vs. s for the year 2005, which results in an intercept /3 = 6.7 ± 0.2 x 10~^ and a 
slope a — 0.5 ±8.5 x 10~*. This result implies a size independent average monthly return 
of 0.67%. 
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Figure 3: A summary of the size dependence of mutual fund growth. The 
average mean /i,. and volatihty u^ of fund returns, as well as the average 
fj,f and volatility aj of money flux (i.e. the flow of money in and out of 
funds), are plotted as a function of the fund size (in millions) for the year 
2005 (see Eqs. ([7]-[l0])). The data are binned based on size, using bins with 
exponentially increasing size; we use monthly units. The average monthly 
return /i^ is compared to a constant return of 0.008 and the monthly volatil- 
ity ar is compared to 0.03. The average monthly flux Hf is compared to a 
line of slope of -0.5 and the money flux volatility af is compared to a line of 
slope -0.35. Thus absent any flow of money in or out of funds, performance 
is independent of size, as dictated by market efficiency. In contrast, both 
the mean and the standard deviation of the money flows of funds decrease 
roughly as a power law as a function of size. 
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date 


12/2005 


9/2005 


6/2005 


3/2005 


12/2004 


Pi 


0.10 ±0.16 


0.40 ±0.98 


0.27 ±0.68 


1.17 ±4.68 


-0.23 ±1.24 


h 


0.14 ±0.27 


0.36 ±1.20 


0.48 ± 0.83 


-0.79 ±3.13 


-0.65 ±2.31 


/Sa 


0.28 ±0.45 


0.01 ±1.07 


0.33 ±0.83 


1.79 ±3.24 


0.60 ±2.57 


/34 


0.56 ±0.40 


-0.28 ±0.85 


0.24 ±1.27 


-0.28 ±1.65 


0.44 ±2.32 


/35 


0.24 ±0.43 


-0.25 ±1.13 


0.21 ±0.90 


-0.24 ±2.95 


0.43 ±2.49 


/36 


0.48 ±0.38 


-0.02 ±1.03 


0.30 ±0.92 


1.27 ±3.50 


0.31 ±2.09 


/3 


-0.02 ±0.02 


0.03 ±0.05 


0.01 ±0.05 


0.14 ±0.21 


0.06 ±0.15 



Table II: Cross-sectional regression coefficients of the monthly fund flow, 
computed for several months, against the performance in past months, as 
indicated in Eq. [25} The regression was computed cross-sectionally using 
data for 6189 equity funds. For example the entry for /?i in the first (from 
the left) column represents the linear regression coefficient of the money 
flux at the end of 2005 on the previous month's return. The errors are 95% 
confidence intervals. 



as otherwise one could obtain superior performance simply by investing in 
larger or smaller funds [Malkiel 1995 . This implies that equity mutual funds 



2006 



3- 



can be viewed as a constant return to scale industry Gabaix et al. 
Both the mean [i^ = -E'[Ar] and the standard deviation Or = Var[A, 
are constant; the latter is also expected from market efficiency, as otherwise 
it would be possible to lower one's risk by simply investing in funds of a 
different size. 

In contrast, the money flux Aj decreases with size. Both the mean 
money flux i^f = E[Af] and its standard deviation uj = Var[Aj]^/^ roughly 
follow a power law over five orders of magnitude in the size s. This is 
similar to the behavior that has been observed for the growth rates of other 



Secchi 2003a 



types of firms Stanley et al. 1995, 1996; Amaral et al. , 1997; Bottazzi and 



As already discussed in footnote 9, there is a large body of 
theory attempting to explain this (and we believe our own theory presented 
elsewhere provides the correct explanation [Schwarzkopf et al 2010]). 



As explained in Section |2.3[ the steady state solution is qualitatively 
different depending on whether the parameters ^oo and cr oo in Eq. [24] are 



positive. As can be seen from the fit parameters in Table III based on data 
for A^ alone, we cannot strongly reject the hypothesis that the drift and 
diffusion rates vanish for large sizes, i.e. /ioo — ^ and o"oo -^ 0. However, 
because the size change A^ can be decomposed as Ag = A^ + Aj, efficiency 
dictates that A^ is independent of size, and since E[Ar] > 0, we are confident 
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variable 


1991- 1998 


1991- 2005 


UJQ 


0.14 


-0.37 


o-u 


3.02 


3.16 


O-Q 


0.35 ±0.02 


0.30 ±0.02 


/3 


0.31 ±0.03 


0.27 ±0.02 


0"oo 


0.05 ±0.01 


0.05 ±0.01 


H' 


0.93 


0.96 


Mo 


0.15 ±0.01 


0.08 ±0.05 


a 


0.48 ±0.03 


0.52 ±0.04 


/"oo 


0.002 ± 0.008 


0.004 ±0.001 


H' 


0.98 


0.97 



Table III: Model parameters as measured from the data in different time 
periods, coq and cj^ are the mean and variance of the average (log) size of 
new funds described in Q. (Tq, /3 and o"oo are the parameters for the size 
dependent diffusion and fio, a and fioo are the parameters of the average 



growth rate (24). The confidence intervals are 95% under the assumption of 
standard errors. The adjusted R^ is given for the fits for each period. The 
time intervals were chosen to match the results shown in Fig. [5] 



that neither /ioo nor cJoo are zero. 

As we showed in Table [III the correlation between the returns A^ and the 
money flux Aj is small. This implies that the standard deviations can be 
written as a simple sum. Since A^ is independent of size and both the mean 
and standard deviation of Aj are power laws, this indicates that Eq. 2.3 is 
a good approximation, and that ^oo and o"oo are both greater than zero. As 
illustrated in Figure |4j these functional forms fit the data reasonably well, 
with only slight variations of parameters in different periods, as shown in 
Table imi 



5 Testing the predictions of the model 

In this section we use our calibrated model of the entry, exit and size- 
dependent growth processes to simulate the evolution of the firm size dis- 
tribution through time. We are forced to use a simulation since, once we 
include the size dependence of the diffusion and drift terms as given in 



equation (24), we are unable to find an analytic solution for the general 
diffusion equation (Eq. (pi)). The analytic solution of the size independent 
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Figure 4: An illustration that the empirical power law-based model provides 
a good fit to the distribution of mutual funds, (a) The standard deviation a 
of the logarithmic size change A^ = A (log s) of an equity fund as a function 
of the fund size s (in millions of dollars), (b) The mean // of A^ = A (logs) 
of an equity fund as a function of the fund size s (in millions of dollars). 
The data for all the funds were divided into 100 equally occupied bins, /i is 
the mean in each bin and a is the square root of the variance in each bin for 



the years 1991 to 2005. The data are compared to a fit according to (24) in 
Figures (a) and (b) respectively. 



case (Eq. (|17|)) gives the correct qualitative behavior, but the match is much 
better once one includes the size dependence. 

The simulation was done on a monthly time scale, averaging over 1000 
different runs to estimate the final distribution. As we have emphasized in 
the previous discussion the time scales for relaxation to the steady state 
distribution are long. It is therefore necessary to take the huge increase 
in the number of new funds seriously. We begin the simulation in 1991 
and simulate the process for varying periods of time, making our target the 
empirical distribution for fund size at the end of each period. In each case we 
assume the size distribution for injecting funds is log-normal, as discussed 
in Section l4Tl 

To compare our predictions to the empirical data we measure the pa- 
rameters for fund entry, exit and growth using data from the same period 



as the simulation, summarized in Table III A key point is that we are not 
fitting these parameters on the target data for fund size^^ but rather are 



^* It is not our intention to claim that the processes describing fund size are constant 
or even stationary. Thus, we would not necessarily expect that parameters measured on 
periods outside of the sample period will be a good approximation for those in the sample 
period. Rather, our purpose is to show that the random model for the entry, exit and 
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Figure 5: The model is compared to the empirical distribution at different 
time horizons. The left column compares CDFs from the simulation (full 
line) to the empirical data (dashed line). The right column is a QQ-plot 
comparing the two distributions. In each case the simulation begins in 1991 



and is based on the parameters in Table III The first row corresponds to 
the years 1991-1998 and the second row to the years 1991-2005 (in each case 
we use the data at the end of the quoted year). 



fitting them on the individual entry, exit and diffusion processes and then 
simulating the corresponding model to predict fund size. One of our main 
predictions is that the time dependence of the solution is important. In 
Figure [5] we compare the predictions of the simulation to the empirical data 
at two different ending times. The model fits quite well at all time horizons, 
though the fit in the tail is somewhat less good at the longest time horizon. 
Note, that our simulations make it clear that the fiuctuations in the tail are 
substantial. The deviations between the prediction and the data are thus 
very likely due to chance - many of the individual runs of the simulation 
deviate from the mean of the 1000 simulations more than the empirical data 
does. 



growth processes can explain the distribution of fund sizes. 
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6 Conclusions 

We have argued that the mutual fund size distribution is driven by market 
efficiency, which gives rise to a random growth process. The essential ele- 
ments of the growth process are multiplicative random changes in the size 
of existing funds, entry of new funds, and exit of existing funds as they 
go out of business. We find, however, that entry plays no role at all other 
than setting the scale; exit plays a small role in thickening the tails of the 
distribution, but this acts only on a very slow timescale. The log-normality 
comes about because the industry is young and still in a transient state, and 
the exit process has not had a sufficient time to act. In the future, if the 
conditions for fund growth and exit were to remain stationary for more than 
a century, the distribution would become a power law. The thickening of the 
tails happens from the body of the distribution outward, as the power law 
tail extends to successively larger funds. We suspect that the conditions are 
highly unlikely to remain this stationary, and that the fund size distribution 
will remain indefinitely in its current log-normal, out of equilibrium state. 

There is also an interesting size dependence in the growth rate of mutual 
fund size, which is both like and unlike that of other types of firms. Mutual 
funds are distinctive in that their overall growth rates can be decomposed 
as a sum of two terms. As = Aj -|- A,., where Af represents the flow of 
money in and out of funds, and A^ the returns on money that is already 
in the fund. The money flow Aj decreases as a power law as a function of 
size, similar to what is widely observed in the overall growth rates for other 
types of flrms. Furthermore the exponents are similar to those observed 
elsewhere. The returns A^, in contrast, are essentially independent of fund 
size, as they must be under market efficiency. As a result, for large sizes the 
mean and variance of the overall growth are constant - this is unlike other 
firms, for which the mean and variance appear to go to zero in the limit. As 
we discuss here, this makes a difference in the long-term evolution: While 
the exit process is driving mutual funds to evolve toward a heavier-tailed 
distribution, other firms are evolving toward a thinner-tailed distribution. 
Again, though, due to the extremely slow relaxation times, we suspect this 
makes little or no difference. 

Our analysis here suggests that the details of investor preference have a 
negligible influence on the upper tail of the mutual fund size distribution, 
except insofar as investors choose funds so as to enforce market efficiency. 
Investor preference enters our analysis only through Aj, the flow of money 
in and out of the fund. Since Aj becomes relatively small in the large 
size limit, the growth of large funds is dominated by the returns A,., whose 
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mean and variance are constant. Thus the upper tail of the size distribution 
is determined by market efficiency, which dictates both that returns are 
essentiaUy random, and thus diffusive, and that there is no dependence on 
size. As a result, for large fund size investor preference doesn't have much 
influence on the growth process. This is reinforced by the fact that the 
statistical properties of the money flux Aj are essentially like those of the 
growth of other ffims. 

How can size-dependent transaction costs be compatible with our results 
here? We have performed an empirical study, which we will report elsewhere, 
that demonstrates that as size increases fund managers maintain constant 
after-transaction cost performance by lowering fees, reducing trading and 
diversifying investments. This is in contrast to the theory proposed by 



Berk and Green (2004) that fund size is determined by the skill of fund 
managers, i.e. that better managers attract more investment until increased 
transaction cost causes excess returns to disappear. Both our theory and 
that of Berk and Green are based on market efficiency. The key difference is 
that we find that the flatness of performance vs. size is enforced by simple 
actions taken by fund managers that do not influence the diffusion of fund 
size. In contrast, the Berk and Green theory requires choices by investors 
that directly influence fund size, and thus is not compatible with the free 
diffusion that we have prevented empirical evidence for here. In their theory 
transaction costs and investor skill determine fund size; in our theory, neither 
plays a role. 

We would like to stress that, while we are fitting econometric models to 
the entry, exit and growth processes, and calibrating these models against 
the data, we are not fitting any parameters on the size data itself. This 
makes it challenging to get a model that fits as well as the model shown 
in Fig. [5j Of course, we have only demonstrated that the random process 
model is sufficient to explain fund size; we cannot demonstrate that other 
explanations might not also be able to explain it. However, the assumptions 
that we make here are simple and natural. The stochastic nature of fund 
growth is not surprising: It is well known that past returns do not pre- 
dict future returns. Thus even if investors chase returns, they are chasing 
something that is inherently random. We believe that this is at the core of 
why our model works so well. Our demonstration that a good explanation 
can be obtained based on market efficiency alone, which requires weaker as- 
sumptions than full rationality, provides a theory that is robust and largely 
independent of the details of human choice. 
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Inadequacy of Gini coefficients to characterize 
tail behavior 
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Figure 6: The Gini coefficients as described in equation (26) are calculated 



numerically for a lognormal distribution and a Pareto distribution. The Gini 
Coefficients were calculated for different parameter values and are plotted as 
a function of the resulting standard deviation. For the Pareto distribution 
(footnote 20) we used sq = 0.01 and different exponents a in the range 
(2, 5], i.e. a finite second moment. The lower standard deviation a = 0.0033 
corresponds to a = 5 and a = 1916.17 corresponds to a — )• 2. For the 
lognormal we used a = and different b in the range [0.1, 2.8] where 6 = 0.1 
corresponds to it = 0.101 and b = 2.757 corresponds to a = 2000. 



The Gini coefficient Gini 1912 is commonly used as a measure of in- 



equality but as we show here it is not suitable for distinguishing between 
highly skewed distributions when one wishes to focus on tail behavior. For 
a non negative size s with a CDF F{s), the Gini coefficient G is given by 



G 



1 



E[s] 



F{s){l - F{s))ds, 



(26) 



where E[s] is the mean Dorfman 1979). To illustrate the problem we com- 



19 



pare the Gini coefficients of a Pareto distribution to those of a lognormal 
For a Pareto distribution with tail parameter a the m > a moments do 
not exist. This is in contrast to the lognormal distribution, for which all 



^^ The CDF of the Pareto distribution is defined as 



(27) 



where so is the minimum size and a is the tail exponent. The CDF of a lognormal is given 
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moments exist. Naively one would therefore expect that the Gini coeffi- 
cient of the Pareto distribution (see footnote 18) to be larger than that of 
a lognormal since it has a heavier upper tail. This is true for a Pareto dis- 
tribution with a < 2, for which the Gini coefficient is one due to the fact 
that the standard deviation does not exist. However, when a < 2, for large 
standard deviations the Gini coefficient of the log-normal is greater than 
that of the Pareto, as shown in Figure [6} In order to compare apples to 
apples in Figure [6] we plot the Gini coefficient as a function of the standard 
deviation (which is a function of the distribution parameters) . For a Pareto 
distribution with a finite second moment (q > 2) the lognormal has a higher 
coefficient. 

Thus, even though the Gini coefficient is frequently used as measure for 
inequality, it is not a good measure when one seeks to study tail proper- 
ties, particularly for comparisons of distributions with different functional 
representations. The reason is that the Gini coefficient is a property of the 
whole distribution, and depends on the shape of the body as well as the tail. 
Similar remarks apply to the Herfindahl index. 

B Simulation model 

We simulate a model with three independent stochastic processes. These 
processes are modeled as Poisson process and as such are modeled as having 
at each time step a probability for an event to occur. The simulation uses 
asynchronous updating to mimic continuous time. At each simulation time 
step we perform one of three events with an appropriate probability. These 
probabilities will determine the rates in which that process occurs. The 
probability ratio between any pair of events should be equal to the ratio of 
the rates of the corresponding processes. Thus, if we want to simulate this 
model for given rates our probabilities are determined. 
These processes we simulate are: 

1. The rate of size change taken to be 1 for each fund and N for the 
entire population. 

Thus, each fund changes size with a rate taken to be unity. 

2. The fund exit rate A which can depend on the fund size. 

log(s) - a 



by 



Finis) = ^ ( 1 + Erf 



V2b 
where a is a location parameter and b is the scale parameter 
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3. The rate of creation of new funds i/. 

Each new fund enters with a size uj with a probabihty density /(w). 

Since some of these processes are defined per firm as opposed to the 
creation process, the simulation is not straightforward. We offer a brief 
description of our simulation procedure. 

1. At every simulation time step, with a probability ^^^ a new fund 
enters and we proceed to the next simulation time step. 

2. If a fund did not enter then the following is repeated (1 + A)A^ times. 

a. We pick a fund at random. 

b . With a probability of jrj the fund enters. 

c. If it is not annihilated, which happens with a probability of jxt; 
we change the fund size. 

We are interested in comparing the simulations to both numerical and 
empirical results. The comparisons with analytical results are done for spe- 
cific times and for specific years when comparing to empirical data. In order 
to do so, we need to convert simulation time to "real" time. The simulation 
time can be compared to 'real' time if every time a fund does not enter 
we add a time step. Because of the way we defined the probabilities each 
simulation time step is comparable to 1/(1 + A) in "real" time units. The 
resulting "real" time is then measured in what ever units our rates were 
measured in. In our simulation we use monthly rates and as such a unit 
time step corresponds to one month. 
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